National Repository of Grey Literature 17 records found  1 - 10next  jump to record: Search took 0.01 seconds. 
Extraction of multilingual valency frames from dependency treebanks
Faryad, Ján ; Zeman, Daniel (advisor) ; Lopatková, Markéta (referee)
Multilingual valency dictionaries provide helpful information about correspon- dence of valency frames (verbs and their arguments) across various languages. This work aims at developing a program that automatically creates a multi- lingual valency dictionary, based on parallel treebanks annotated according to Universal Dependencies. This task includes monolingual extraction of va- lency frames and their cross-lingual linking. Various methods for solving the task are analysed and implemented. The work includes both general, language- independent approach and additional language-specific extensions, provided in particular for English, Czech and Slovak. The methods for linking the valency frames include using word alignment, morphological and syntactic information contained in the UD annotation or similarity of verbs between related languages. The quality of the solution is evaluated by multiple established metrics on man- ually annotated data or by comparison with an existing valency dictionary. 1
Automatic linking of lexicographic sources and corpus data
Bejček, Eduard ; Lopatková, Markéta (advisor) ; Horák, Aleš (referee) ; Žabokrtský, Zdeněk (referee)
Along with the increasing development of language resources - i.e., new lexicons, lexical databases, corpora, treebanks - the need for their efficient interlinking is growing. With such a linking, one can easily benefit from all their properties and information. Considering the convergence of resources, universal lexicographic formats are frequently discussed. In the present thesis, we investigate and analyse methods of interlinking language resources automatically. We introduce a system for interlinking lexicons (such as VALLEX, PDT-Vallex, FrameNet or SemLex) that offer information on syntactic properties of their entries. The system is automated and can be used repeatedly with newer versions of lexicons under development. We also design a method for identification of multiword expressions in a parsed text based on syntactic information from the SemLex lexicon. An output that verifies feasibility of the used methods is, among others, the mapping between the VALLEX and the PDT-Vallex lexicons, resulting in tens of thousands of annotated treebank sentences from the PDT and the PCEDT treebanks added into VALLEX. Powered by TCPDF (www.tcpdf.org)
Valency of Verbs in the Prague Dependency Treebank
Urešová, Zdeňka ; Hajičová, Eva (advisor) ; Lopatková, Markéta (referee) ; Ondrejovič, Slavo (referee)
Title: Valency of verbs in the Prague Dependency Treebank Author: PhDr. Zdeňka Urešová Department: Institute of Formal and Applied Linguistics MFF UK Supervisor: Prof. PhDr. Eva Hajičová, DrSc. Abstract: This dissertation describes PDT-Vallex, a valency lexicon of Czech verbs, and its relation to the annotation of the Prague Dependency Treebank (PDT). The PDT-Vallex lexicon was created during the an- notation of the PDT and it is a valuable source of verbal valency information available both for linguistic research and for computer- ized natural language processing. In this thesis, we describe not only the structure and design of the lexicon (which is closely related to the notion of valency as developed in the Functional Generative De- scription of language) but also the relation between the PDT-Vallex and the PDT. The explicit and full-coverage linking of the lexicon to the treebank prompted us to pay special attention to diatheses; we propose formal transformation rules for diatheses to handle their surface realization even when the canonical forms of verb arguments as captured in the lexicon do not correspond to the forms of these arguments actually appearing in the corpus.
Syntax of Adjective Compounds in a Bilingual Dictionary
Schmidtová, Iva ; Vachková, Marie (advisor) ; Hejhalová, Věra (referee)
The aim of the given thesis is to expound the problem of syntax processing by creating adjectival compounds in dictionary entries for the Large German-Czech Academic Dictionary. Assuming that exactly these words, which belong to the complicated vocabulary items, cause difficulties for foreign speakers with text comprehension and text production. It is based on the nature of German language, which makes many compounds, as opposed to in Czech language. Not only are the meanings of these words revealed to be problematic, but also their usage. The thesis is divided into two parts. The theoretical part deals with characteristics of adjectives as word class, and because each word always has its context, attention will be primarily focused on the adjectival valency. Previously, only the valency of verbs and substantives has been discussed. When a native speaker automatically knows these syntax structures and even when, in case of foreign language, there could exist structural similarities between the source and target language, the structures ought consequently to be given in the dictionary. In some mono- or bilingual dictionary the user should find complex informations for word searched. The practical part involves analysis of ten chosen adjective compounds, which shows how the valency structures are...
Lexicographic treatment of the valency aspects of verbal diatheses
Vernerová, Anna ; Lopatková, Markéta (advisor) ; Ivanová, Martina (referee) ; Petkevič, Vladimír (referee)
Název práce: Slovníkové zpracování valenčních aspektů slovesných diatezí Autor: Anna Vernerová Ústav: Ústav formální a aplikované lingvistiky Vedoucí disertační práce: doc. RNDr. Markéta Lopatková, Ph.D., Ústav formální a aplikované lingvistiky Klíčová slova: valence, diateze, pasivní participium Abstrakt: Diateze, a to jak ty tvořené pomocí pasivního participia (pasivum, prostý a posesivní rezultativ, recipientní diateze), tak i tzv. zvratné pasivum (deagentizace) byly v minulosti předmětem řady studií jak v bohemistické, tak i v mezinárodní lingvistice, pro češtinu ale dosud chybělo jejich důkladné slovníkové zpracování. V této dizertační práci se zabývám zachycením diatezí tvořených pomocí pasivního participia a s nimi příbuzných verbonominálních konstrukcí v gramatické komponentě valenčního slovníkuVALLEX. Vlastnímu tématu práce předchází krátký historický úvod a podrobné shrnutí pojetí valence ve Funkčním generativním popisu. Title: Lexicographic treatment of the valency aspects of verbal diatheses Author: Anna Vernerová Department: Institute of Formal and Applied Linguistics Supervisor: doc. RNDr. Markéta Lopatková, Ph.D., Institute of Formal and Applied Linguistics Keywords: valency, diathesis, passive participle Abstract: Diatheses have been the topic of a number of linguistic studies in Czech as...
Verbal Valency in a Cross-Linguistic Perspective
Šindlerová, Jana ; Lopatková, Markéta (advisor) ; Petkevič, Vladimír (referee) ; Malá, Markéta (referee)
Verbal Valency in a Cross-Linguistic Perspective Jana Šindlerová Abstract In the thesis, we look upon differences in argument structure of verbs considering the Czech language and the English language. In the first part, we describe the process of building the CzEngVallex lexicon. In the second part, based on the aligned data of the Prague Czech-English Dependency Treebank, we compare the valencies of verbal translation equivalents and comment of their differences. We classify the differences according to their underlying causes. The causes can be based in the linguistic structure of the languages, they can include translatological reasons, or they can be grounded in the character of the descriptive linguistic theory used.
Automatic linking of lexicographic sources and corpus data
Bejček, Eduard ; Lopatková, Markéta (advisor) ; Horák, Aleš (referee) ; Žabokrtský, Zdeněk (referee)
Along with the increasing development of language resources - i.e., new lexicons, lexical databases, corpora, treebanks - the need for their efficient interlinking is growing. With such a linking, one can easily benefit from all their properties and information. Considering the convergence of resources, universal lexicographic formats are frequently discussed. In the present thesis, we investigate and analyse methods of interlinking language resources automatically. We introduce a system for interlinking lexicons (such as VALLEX, PDT-Vallex, FrameNet or SemLex) that offer information on syntactic properties of their entries. The system is automated and can be used repeatedly with newer versions of lexicons under development. We also design a method for identification of multiword expressions in a parsed text based on syntactic information from the SemLex lexicon. An output that verifies feasibility of the used methods is, among others, the mapping between the VALLEX and the PDT-Vallex lexicons, resulting in tens of thousands of annotated treebank sentences from the PDT and the PCEDT treebanks added into VALLEX. Powered by TCPDF (www.tcpdf.org)
Linguistic Issues in Machine Translation between Czech and Russian
Klyueva, Natalia ; Kuboň, Vladislav (advisor) ; Panevová, Jarmila (referee) ; Strossa, Petr (referee)
In this thesis we analyze machine translation between Czech and Russian languages from the perspective of a linguist. We work with two types of Machine Translation systems - rule-based (TectoMT) and statistical (Moses). We experiment with different setups of these two systems in order to achieve the best possible quality. One of the questions we address in our work is whether relatedness of the discussed languages has some impact on machine translation. We explore the output of our two experimental systems and two commercial systems: PC Translator and Google Translate. We make a linguistically-motivated classification of errors for the language pair and describe each type of error in detail, analyzing whether it occurred due to some difference between Czech and Russian or is it caused by the system architecture. We then compare the usage of some specific linguistic phenomena in the two languages and state how the individual systems cope with mismatches. For some errors, we suggest ways to improve them and in several cases we implement those suggestions. In particular, we focus on one specific error type - surface valency. We research the mismatches between Czech and Russian valency, extract a lexicon of surface valency frames, incorporate the lexicon into the TectoMT translation pipeline and present...
The complement of the Spanish adjective
Koťátková, Adéla ; Čermák, Petr (advisor) ; Kratochvílová, Dana (referee)
The aim of this thesis is to analyse and describe the complement of the Spanish adjective. We clarify the heterogeneity of linguistic theories that approach this topic. We select the theory of valency in order to delineate the complement of adjective as a valence complement. We review its characteristics and compare the information in the quoted literature with the results of linguistic corpuses CREA and InterCorp. In the practical part, we analyse the corpuses for the combinations of adjective valency, obligation, the possibility of ellipsis, the influence of quantification on the complement and collocations. Key words: Spanish - linguistic corpus - InterCorp - CREA - valency - complement - adjective
Syntax of Adjective Compounds in a Bilingual Dictionary
Schmidtová, Iva ; Vachková, Marie (advisor) ; Hejhalová, Věra (referee)
The aim of the given thesis is to expound the problem of syntax processing by creating adjectival compounds in dictionary entries for the Large German-Czech Academic Dictionary. Assuming that exactly these words, which belong to the complicated vocabulary items, cause difficulties for foreign speakers with text comprehension and text production. It is based on the nature of German language, which makes many compounds, as opposed to in Czech language. Not only are the meanings of these words revealed to be problematic, but also their usage. The thesis is divided into two parts. The theoretical part deals with characteristics of adjectives as word class, and because each word always has its context, attention will be primarily focused on the adjectival valency. Previously, only the valency of verbs and substantives has been discussed. When a native speaker automatically knows these syntax structures and even when, in case of foreign language, there could exist structural similarities between the source and target language, the structures ought consequently to be given in the dictionary. In some mono- or bilingual dictionary the user should find complex informations for word searched. The practical part involves analysis of ten chosen adjective compounds, which shows how the valency structures are...

National Repository of Grey Literature : 17 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.